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(54) DATA PROCESSOR 

(57)Abstract: 

PROBLEM TO BE SOLVED: To improve the throughput of a data 
processor for successively obtaining plural transposed matrixes. 
SOLUTION: This processor is provided with a memory cell array 
200 which arranged in a 4x4 lattice form memory cells 201 each 
of which consists of two data input ports and two data output 
ports in order to obtain each transposed matrix from plural 4x4 
input matrices successively given. For certain five cycles, a column 
of the memory cell array 200 is read and in one cycle in the next 
cycle writing a column or row vector of an input matrix is written in 
the same column which was read, of the memory cell array 200 and 
this operation is repeated. For the next five cycles, a row of the 
memory cell array 200 is read and in one cycle reads in the next 
cycle writing a column or a row of the next input matrix is written 
in the same row which was read of the memory cell array 200 and 
this operation is repeated. Thus, a transposed matrix is obtained 
with a rate of one per five cycles. 
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[Claim(s)] 

[Claim 1] It is a data processor for [ which asks for each transposed matrix respectively 
from two or more input matrices of a n line n train (n is two or more integers) ] having 
been given one by one. The memory cell array which comes to arrange the memory cell of 
the nxn individual which has the 1st and 2nd data input/output port respectively in the 
shape of a grid, n line buses by which common connection was made in each 1st data 
input/output port of n memory cells which constitute one line to which it corresponds of 
said memory cell arrays respectively, Each 2nd data input/output port of n memory cells 
which constitute one train to which it corresponds of said memory cell arrays respectively 
is equipped with n train buses by which common connection was made. Each train of said 
memory cell array is read between a certain n cycles in the first half of each cycle using 
said n line buses. And each line or column vector of an input matrix which it is in the 
second half of each cycle is written in the same train as read-out of said memory cell array. 
Each line of said memory cell array is read between the following n cycles in the first half 
of each cycle using said n train buses. And the data processor characterized by writing 
each following line or following column vector of an input matrix in the same line as 
read-out of said memory cell array in the second half of each cycle. 

[Claim 2] It is a data processor for [ which asks for each transposed matrix respectively 
from two or more input matrices of a n line n train (n is two or more integers) ] having 
been given one by one. The memory cell array which comes to arrange the memory cell of 
the nxn individual which has the 1st, 2nd data input port and the 1st, and 2nd data output 
ports respectively in the shape of a grid, Each 1st data input port of n memory cells which 
constitute one line to which it corresponds of said memory cell arrays respectively, n input 
buses by which common connection was made in each 2nd data input port of n memory 
cells which constitute one train to which it corresponds of said memory cell arrays, Each 
1st data output port of n memory cells which constitute one line to which it corresponds of 
said memory cell arrays respectively, Each 2nd data output port of n memory cells, which 
constitute one train to which it corresponds of said memory cell arrays is equipped with n 
output buses by which common connection was made. In a certain cycle, one train of said 
memory cell array is read between a certain n+1 cycles using said n output buses. It 
repeats writing the line or column vector of an input matrix which has used said n input 
buses in the following cycle in the same train as read-out of said memory cell array. In a 
certain cycle, one line of said memory cell array is read between the n+1 following cycles 
using said n output buses. The data processor characterized by making it repeat writing the 
following line or following column vector of an input matrix in the same line as read-out 
of said memory cell array using said n input buses in the following cycle. 
[Detailed Description of the Invention] 
[0001] 

[Field of the Invention] This invention relates to the data processor for asking for each 
transposed matrix from two or more input matrices which are used in various fields, such 
as scientific calculation and an image processing, and which were given one by one. 
[0002] 

[Description of the Prior Art] The thing equipped with the memory cell array which comes 
to arrange the memory cell of the nxn individual which has a data input port and a data 
output port respectively in the shape of a grid as a data processor for asking for a 
transposed matrix from the input matrix of a n line n train (n is two or more integers) is 
known. According to this data processor, in each cycle, each column vector of an input 
matrix is written in for every train of a memory cell array between the first n cycles. And 
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in each cycle, reading appearance of each row vector of an output matrix is carried out for 
evety line of a memory cell array between the following n cycles. An output matrix 
becomes what transposed the input matrix here. In order to ask for the transposed matrix 
of the following input matrix, 2 moren cycle is required. 
[0003] 

[Problem(s) to be Solved by the Invention] According to the above-mentioned 
conventional data processor, 2n cycle was taken to ask for each transposed matrix from 
two or more input matrices given one by one, respectively, and there was a problem that 
processing speed was slow. 

[0004] The object of this invention is to raise the throughput of the data processor for 

asking for two or more transposed matrices one by one. 

[0005] 

[Means for Solving the Problem] In order to attain the above-mentioned object, in this 
invention, the data bus configuration and its address-generation sequence of a memory cell 
array were devised. 

[0006] The 1st data processor concerning this invention specifically It is a data processor 
for [ which asks for each transposed matrix respectively from two or more input matrices 
of a n line n train (n is two or more integers) ] having been given one by one. The memory 
cell array which comes to arrange the memory cell of the nxn individual which has the 1 st 
and 2nd data input/output port respectively in the shape of a grid, n line buses by which 
common connection was made in each 1st data input/output port of n memory cells which 
constitute one line to which it corresponds of these memory cell arrays respectively, The 
configuration which equipped each 2nd data input/output port of n memory cells which 
constitute one train to which it corresponds of said memory cell arrays respectively with n 
train buses by which common connection was made is adopted. And between a certain n 
cycles, using n line buses, each train of a memory cell array is read, and each line or 
column vector of an input matrix which it is in the second half of each cycle was written 
in the same train as read-out of said memory cell array in the first half of each cycle. 
Moreover, between the following n cycles, using n train buses, each line of a memory cell 
array is read, and each following line or following column vector of an input matrix was 
written in the same line as read-out of said memory cell array in the second half of each 
cycle in the first half of each cycle. According to this configuration, a transposed matrix 
can be obtained at a rate of one piece in n cycle. 

[0007] Moreover, the 2nd data processor concerning this invention It is a data processor 
for [ which asks for each transposed matrix respectively from two or more input matrices 
of a n line n train (n is two or more integers) ] having been given one by one. The memory 
cell array which comes to arrange the memory cell of the nxn individual which has the 1 st, 
2nd data input port and the 1st, and 2nd data output ports respectively , in the shape of a 
grid, n input buses by which common connection was made in each 2nd data input port of 
n memory cells which constitute one train to which it corresponds of each 1st data input 
port of n memory cells which constitute one line to which it corresponds of these memory 
cell arrays respectively, and these memory cell arrays, One line to which it corresponds of 
said memory cell arrays respectively The configuration which equipped each 2nd data 
output port of n memory cells which constitute one train to which it corresponds of each 
1st data output port of n memory cells to constitute and these memory cell arrays with n 
output buses by which common connection was made is adopted. And one train of a 
memory cell array is read using n output buses, and it was made to repeat writing the line 
or column vector of an input matrix which has used n input buses in the same train as 
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read-out of said memory cell array by the following cycle in a certain cycle between a 
certain n+1 cycles. Moreover, one line of a memory cell array is read using n output buses, 
and it was made to repeat writing the following line or following column vector of an 
input matrix in the same line as read-out of said memory cell array using n input buses in 
the following cycle by a certain cycle between the n+1 following cycles. According to this 
configuration, a transposed matrix can be obtained at a rate of one piece in n+1 cycle. 
[0008] 

[Embodiment of the Invention] Hereafter, the gestalt of operation of this invention is 
explained, referring to a drawing. 

[0009] Drawing 1 shows the input matrix (4x4 matrices) of the four-line four trains which 
are the examples of a processing object of transposition. Matrix D (k) is the k-th input 
matrix (k is an integer). When you set a line number to i (i= 0, 1,2, 3) and you set a row 
number to j (j= 0, 1, 2, 3), suppose that the element of Matrix D (k) is expressed with d (k) 
and (ij). 

[0010] Drawing 2 shows the example of a configuration of the data processor concerning 
this invention for asking for the transposed matrix from the 4x4 matrix D of drawing 1 (k). 
The column vector of Matrix D (k) shall be given to the data processor of drawing 2 as an 
input vector one by one. Namely, the 1 st column vector which consists of four element d 
(k), (00), d (k) and (10), d (k) and (20), and d (k) and (30) is given first. The 2nd column 
vector which consists of four element d (k), (01), d (k) and (11), d (k) and (21), and d (k) 
and (31) is given to a degree. The 3rd column vector which consists of four element d (k), 

(02) , d (k) and (12), d (k) and (22), and d (k) and (32) is given to a degree. The 4th column 
vector which consists of four element d (k), (03), d (k) and (13), d (k) and (23), and d (k) 
and (33) shall be given to a degree. 

[0011] The data processor of drawing 2 is equipped with the memory cell array 100 which 
comes to arrange 4x4 memory cells 101 which have the 1st and 2nd data input/output port 
respectively in the shape of a grid. When you set a line number to p (p= 0, 1,2, 3) and you 
set a row number to q (q= 0, 1, 2, 3), suppose that each memory cell 101 is expressed with 
a (pq). 

[0012] In drawing 2 , RB0, RBI, RB2, and RB3 carry out four line buses, and CB0, CB1, 
CB2, and CB3 are carrying out the table of the four train buses, respectively. Common 
connection of the line bus RB0 is made in the 1st data input/output port of four memory 
cell a (00) which constitutes the 1st line of the memory cell array 100, a (01), a (02), and a 

(03) . Common connection of the line bus RBI is made in the 1st data input/output port of 
four memory cell a (10) which constitutes the 2nd line of the memory cell array 100, a 

(11) , a (12), and a (13). Common connection of the line bus RB2 is made in the 1st data 
input/output port of four memory cell a (20) which constitutes the 3rd line of the memory 
cell array 100, a (21), a (22), and a (23). Common connection of the line bus RB3 is made 
in the 1st data input/output port of four memory cell a (30) which constitutes the 4th line 
of the memory cell array 100, a (31), a (32), and a (33). Common connection of the train 
bus CB0 is made in the 2nd data input/output port of four memory cell a (00) which 
constitutes the 1st train of the memory cell array 100, a (10), a (20), and a (30). Common 
connection of the train bus CB1 is made in the 2nd data input/output port of four memory 
cell a (01) which constitutes the 2nd train of the memory cell array 100, a (11), a (21), and 
a (31). Common connection of the train bus CB2 is made in the 2nd data input/output port 
of four memory cell a (02) which constitutes the 3rd train of the memory cell array 100, a 

(12) , a (22), and a (32). Common connection of the train bus CB3 is made in the 2nd data 
input/output port of four memory cell a (03) which constitutes the 4th train of the memory 
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cell array 100, a (13), a (23), and a (33). 

[0013] In drawing 2 , SCO, SCI, SC2, and SC3 are carrying out the table of the train 
selection signal for choosing one train to which a line selection signal for SRO, SRI, SR2, 
and SR3 to choose one line to which it corresponds of the memory cell arrays 100 
respectively is respectively corresponded of the memory cell arrays 100, respectively. 
CLK is a clock signal and XCLK is a reversal clock signal. 

[0014] The data processor of drawing 2 is further equipped with the input selector 110 and 
the output selector 111. The input selector 110 supplies given input- vector DI to either the 
line buses RB0-RB3 or the train buses CB0-CB3. An output selector 111 supplies either of 
the vectors by which reading appearance was carried out as an output vector DO on the 
train buses CB [ CB0-] 3 from the vector by which reading appearance was carried out on 
the line buses RB [ RB0-] 3 from the memory cell array 100, or this memory cell array 
100. It is specified by change signal R/C any of the line buses RB0-RB3 and the train 
buses CB0-CB3 the input selector 110 and an output selector 111 should choose. 
[0015] Drawing 3 shows the detail of one memory cell 101 in drawing 2 . The memory 
cell 101 of drawing 3 has 1st data input/output port IOl, the 1st selection input terminal 
SEL1, 2nd data input/output port 102, and the 2nd selection input terminal SEL2. When a 
high-level signal is supplied to the 1st selection input terminal SEL1, 1st data input/output 
port IOl is chosen. And reading appearance of the data is carried out to the high-level 
period of a clock signal (lead signal) CLK from 1st data input/output port IOl, and data 
are written in 1st data input/output port IOl at the high-level period of reversal clock 
signal (light signal) XCLK. When a high-level signal is supplied to the 2nd selection input 
terminal SEL2, 2nd data input/output port 102 is chosen. And reading appearance of the 
data is carried out to the high-level period of a clock signal (lead signal) CLK from 2nd 
data input/output port 102, and data are written in 2nd data input/output port 102 at the 
high-level period of reversal clock signal (light signal) XCLK. In memory cell [ of 
drawing 3 ] a (03), 1st data input/output port IOl is connected to the line bus RB0, 2nd 
data input/output port 102 is connected to the train bus CB3, respectively, the train 
selection signal SC 3 is supplied to the 1st selection input terminal SEL1, and the line 
selection signal SR 0 is supplied to the 2nd selection input terminal SEL2, respectively. 
[0016] Drawing 4 shows the example of a configuration of the control circuit for 
controlling actuation of the data processor of drawing 2 . For 120, as for 1/4 
counting-down circuit and 122, in drawing 4 , an inverter and 121 are [ 4 ** counter and 
123 ] decoders. An inverter 120 generates reversal clock signal XCLK from a clock signal 
CLK. 1/4 counting-down circuit 121 generates change signal R/C which has the frequency 
of the quadrant of the frequency of a clock signal CLK. Generated change signal R/C is 
not only supplied to said input selector 110 and output selector 111, but is supplied to a 
decoder 123. 4 ** counter 122 carries out counting of the pulse of a clock signal CLK, and 
supplies 4 ** counted value CT, 0, 1, and 2,4 or 3 to a decoder 123. [ i.e., ] When change 
signal R/C is high-level, a decoder 123 will supply the high-level train selection signal SC 
3, respectively, if 4 ** counted value CT4 becomes zero, 4 ** counted value CT4 will 
become one about the high-level train selection signal SC 0, 4 ** counted value CT4 will 
become two about the high-level train selection signal SC 1 and 4 ** counted value CT4 
will become three about the high-level train selection signal SC 2. Moreover, when change 
signal R/C is a low level, a decoder 123 will supply the high-level line selection signal SR 
3, respectively, if 4 ** counted value CT4 becomes zero, 4 ** counted value CT4 will 
become one about the high-level line selection signal SR 0, 4 ** counted value CT4 will 
become two about the high-level line selection signal SR 1 and 4 ** counted value CT4 
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will become three about the high-level line selection signal SR 2. 

[0017] Drawing 5 and drawing 6 show actuation of the data processor of drawing 2 . Each 
column vector of an input matrix D (k) is written in for every train of the memory cell 
array 100 in the second half of each cycle using four line buses RB0-RB3 between the 
four cycles from the cycle Tl shown in drawing 5 to cycle T four. That is, the memory cell 
array 100 is filled with all the elements of an input matrix D (k) at cycle T four. And 
between the four cycles from the cycle T5 shown in drawing 6 to a cycle T8, using four 
train buses CB0-CB3, reading appearance of each line of the memory cell array 100 is 
carried out, and it is written in the line as read-out with each column vector of the 
following input matrix D (k+1) same in the second half which is each cycle in the first half 
of each cycle. It is a row vector of an input matrix D (k) respectively, Vector DO, i.e., the 
output vector, by which reading appearance was carried out in each cycle from a cycle T5 
to a cycle T8. Therefore, it means that the transposed matrix of an input matrix D (k) was 
obtained here. In addition, between the four cycles from a cycle Tl to cycle T four, using 
four line buses RB0-RB3, reading appearance of each train of the memory cell array 100 
is carried out, and, according to drawing 5 , it is written in the train as read-out with each 
column vector of an input matrix D (k) same in the second half which is each cycle in the 
first half of each cycle. Four vectors on which the four cycle from a cycle Tl to cycle T 
four was bet and by which reading appearance was carried out constitute the transposed 
matrix of the input matrix D (k-1) in front of an input matrix D (k). 

[0018] In order to ask for each transposed matrix from two or more 4x4 input matrices 
given one by one according to the data processor of drawing 2 the above passage The 
actuation which writes one column vector of an input matrix in the same line or same train 
as the line or train which carried out reading appearance among the memory cell arrays 
1 00 in the first half of a certain cycle in the second half of this cycle is repeated. And since 
the line buses RB0-RB3 and the train buses CB0-CB3 are properly used for every four 
cycle, a transposed matrix can be obtained at a rate of one piece to a four cycle. 
[0019] Drawing 7 shows other examples of a configuration of the data processor 
concerning this invention for asking for the transposed matrix from the 4x4 matrix D of 
drawing 1 (k). The column vector of Matrix D (k) shall also be given to the data processor 
of drawing 7 as an input vector one by one. 

[0020] The data processor of drawing 7 is equipped with the memory cell array 200 which 
comes to arrange 4x4 memory cells 201 which have the 1st, 2nd data input port and the 
1st, and 2nd data output ports respectively in the shape of a grid. When you set a line 
number to p (p= 0 5 1,2, 3) and you set a row number to q (q= 0, 1,2, 3), suppose that each 
memory cell 201 is expressed with a (pq). 

[0021] In drawing 7 , IB0, IB1, IB2, and IB3 carry out four input buses, and alumnusO, 
alumnus 1, alumnus2, and alumnus3 are carrying out the table of the four output buses, 
respectively. Input-vector DI is given to the input buses IB0-IB3, and the output vector 
DO is acquired from the output buses alumnus0-alumnus3. The 1st four data input port 
from which the input bus IB0 constitutes the 1st line of the memory cell array 200, 
memory cell a (00), a (01), a (02), and a (03), Common connection is made in the 2nd data 
input port of four memory cell a (00) which constitutes the 1st train of the memory cell 
array 200, a (10), a (20), and a (30). The 1st four data input port from which the input bus 
IB1 constitutes the 2nd line of the memory cell array 200, memory cell a (10), a (11), a 
(12), and a (13), Common connection is made in the 2nd data input port of four memory 
cell a (01) which constitutes the 2nd train of the memory cell array 200, a (11), a (21), and 
a (31). The 1st four data input port from which the input bus IB2 constitutes the 3rd line of 
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the memory cell array 200, memory cell a (20), a (21), a (22), and a (23), Common 
connection is made in the 2nd data input port of four memory cell a (02) which constitutes 
the 3rd train of the memory cell array 200, a (12), a (22), and a (32). The 1st four data 
input port from which the input bus IB3 constitutes the 4th line of the memory cell array 
200, memory cell a (30), a (3 1), a (32), and a (33), Common connection is made in the 2nd 
data input port of four memory cell a (03) which constitutes the 4th train of the memory 
cell array 200, a (13), a (23), and a (33). The 1st four data output port from which the 
output bus alumnus 0 constitutes the 1st line of the memory cell array 200, memory cell a 

(00) , a (01), a (02), and a (03), Common connection is made in the 2nd data output port of 
four memory cell a (00) which constitutes the 1 st train of the memory cell array 200, a 
(10), a (20), and a (30). The 1st four data output port from which the output bus alumnus 1 
constitutes the 2nd line of the memory cell array 200, memory cell a (10), a (11), a (12), 
and a (13), Common connection is made in the 2nd data output port of four memory cell a 

(01) which constitutes the 2nd train of the memory cell array 200, a (11), a (21), and a (31). 
The 1st four data output port from which the output bus alumnus 2 constitutes the 3rd line 
of the memory cell array 200, memory cell a (20), a (21), a (22), and a (23), Common 
connection is made in the 2nd data output port of four memory cell a (02) which 
constitutes the 3rd train of the memory cell array 200, a (12), a (22), and a (32). The 1st 
four data output port from which the output bus alumnus 3 constitutes the 4th line of the 
memory cell array 200, memory cell a (30), a (3 1), a (32), and a (33), Common connection 
is made in the 2nd data output port of four memory cell a (03) which constitutes the 4th 
train of the memory cell array 200, a (13), a (23), and a (33). 

[0022] In drawing 7 , line reading appearance of RR0, RR1, RR2, and RR3 is carried out 
for controlling read-out of one line to which it corresponds of the each memory cell arrays 
200. a control signal WR0, WR1, WR2, and WR3 the line write-in control signal for 
controlling the writing of one line to which it corresponds of the memory cell arrays 200 
respectively Train reading appearance of RC0, RC1, RC2, and RC3 is carried out for 
controlling read-out of one train to which it corresponds of the each memory cell arrays 
200. a control signal WC0, WC1, WC2, and WC3 are carrying out the table of the train 
write-in control signal for controlling the writing of one train to which it corresponds of 
the memory cell arrays 200 respectively, respectively. 

[0023] Drawing 8 shows the detail of one memory cell 201 in drawing 7 . The 1st write-in 
control terminal WT 1 for the memory cell 201 of drawing 8 to control the writing of the 
data to the 1st data input port INI and this port INI The 1st read-out control terminal RD 1 
for controlling read-out of the data from the 1st data output port OUT1 and this port 
OUT1 The 2nd write-in control terminal WT 2 for controlling the writing of the data to the 
2nd data input port IN2 and this port IN2 It has the 2nd read-out control terminal RD 2 for 
controlling read-out of the data from the 2nd data output port OUT2 and this port OUT2. 
In memory cell [ of drawing 8 ] a (03), the 1st data input port INI into the input bus IB0 
The 2nd data input port IN2 into the output bus alumnus 0 into the input bus IB 3 [the 1st 
data output port OUT1 ] The 2nd data output port OUT2 is connected to the output bus 
alumnus 3, respectively. The train read-out control signal RC 3 for the 1st read-out control 
terminal RD 1 at the 1st write-in control terminal WT 1 the train write-in control signal 
WC3 The line read-out control signal RR0 is supplied to the 2nd read-out control terminal 
RD 2, and the line write-in control signal WR0 is supplied to the 2nd write-in control 
terminal WT 2, respectively. 

[0024] Drawing 9 shows the example of a configuration of the control circuit for 
controlling actuation of the data processor of drawing 7 . For 221, as for a quinary counter 
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and 223, in drawing 9 , 1/5 counting-down circuit and 222 are [ the decoder for writing 
and' 224 ] the decoders for read-out. 1/5 counting-down circuit 221 generates change 
signal R/C which has 1/5 of the frequencies of the frequency of a clock signal CLK. 
Generated change signal R/C is supplied to both the decoders 223,224. The quinary 
counter 222 carries out counting of the pulse of a clock signal CLK, and supplies quinary 
counted value CT, 0, 1, 2, and 3,5 or 4 to both the decoders 223,224. [ i.e., ] When change 
signal R/C is high-level, the decoder 223 for writing If a signal will not be supplied if 
quinary counted value CT5 becomes zero, but quinary counted value CT5 becomes one, 
the high-level train write-in control signal WCO If quinary counted value CT5 becomes 
two, quinary counted value CT5 will, become three about the high-level train write-in 
control signal WC1 and quinary counted value CT5 will become four about the high-level 
train write-in control signal WC2, the high-level train write-in control signal WC3 will be 
supplied, respectively, moreover, when change signal R/C is a low level, the decoder 223 
for writing If a signal will not be supplied if quinary counted value CT5 becomes zero, but 
quinary counted value CT5 becomes one, the high-level line write-in control signal WRO 
If quinary counted value CT5 becomes two, the high-level line write-in control signal 
WR1 If quinary counted value CT5 becomes three and quinary counted value CT5 will 
become four about the high-level line write-in control signal WR2, the high-level line 
write-in control signal WR3 will be supplied, respectively. When change signal R/C is 
high-level, the decoder 224 for read-out If quinary counted value CT5 becomes zero, the 
high-level train read-out control signal RC 0 If quinary counted value CT5 becomes one, 
the high-level train read-out control signal RC 1 If quinary counted value CT5 becomes 
two and quinary counted value CT5 will become three about the high-level train read-out 
control signal RC 2, the high-level train read-out control signal RC 3 will be supplied, 
respectively, and a signal will not be supplied if quinary counted value CT5 becomes four, 
moreover, when change signal R/C is a low level, the decoder 224 for read-out If quinary 
counted value CT5 becomes zero, the high-level line read-out control signal RRO If 
quinary counted value CT5 becomes one, the high-level line read-out control signal RR1 
If quinary counted value CT5 becomes two and quinary counted value CT5 will become 
three about the high-level line read-out control signal RR2, the high-level line read-out 
control signal RR3 will be supplied, respectively, and a signal will be supplied if quinary 
counted value CT5 becomes four. 

[0025] Drawing 10 and drawing 11 show actuation of the data processor of drawing 7 . In 
each cycle from the cycle t2 shown in drawing 10 to a cycle t5, each column vector of an 
input matrix D (k) is written in for every train of the memory cell array 200 using four 
input buses IB0-IB3. That is, the memory cell array 200 is filled with all the elements of 
an input matrix D (k) in a cycle t5. And between 5 cycles from the cycle t6 shown in 
drawing 1111 to a cycle tlO, by a certain cycle, one line of the memory cell array 200 is 
read using four output buses alumnus0-alumnus3, and actuation of writing one column 
vector of the following input matrix D (k+1) in the same line as read-out using four input 
buses IB0-IB3 is repeated in the following cycle. It is a row vector of an input matrix D 
(k) respectively, Vector DO, i.e., the output vector, by which reading appearance was 
carried out in each cycle from a cycle t6 to a cycle t9. Therefore, it means that the 
transposed matrix of an input matrix D (k) was obtained here. In addition, according to 
drawing 10 ., between 5 cycles from a cycle tl to a cycle t5, one train of the memory cell 
array 200 is read using four output buses alumnus0-alumnus3, and actuation of writing 
one column vector of an input matrix D (k) in the same train as read-out using four input 
buses IB0-IB3 is repeated in the following cycle by a certain cycle. Four vectors on which 
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the four cycle from a cycle tl to a cycle t4 was bet and by which reading appearance was 
carried out constitute the transposed matrix of the input matrix D (k-1) in front of an input 
matrix D (k). 

[0026] In order to ask for each transposed matrix from two or more 4x4 input matrices 
given one by one according to the data processor of drawing 7 the above passage writing 
one column vector of an input matrix in the same line or same train as the line or train 
which carried out reading appearance in a certain cycle among the memory cell arrays 200 
in the following cycle Since the actuation which reads the next line or next train of this 
memory cell array 200 in the latter cycle is repeated and the handling of the row and 
column of the memory cell array 200 is changed every 5 cycles, a transposed matrix can 
be obtained at a rate of one piece in 5 cycle. 

[0027] In addition, even when the row vector of Matrix D (k) is given one by one as an 
input vector, it can ask for a transposed matrix with the data processor which has the 
configuration of drawing 2 or drawing 7 . Probably, it will be easy for this contractor to 
transform each above-mentioned example into the data processor for asking for the 
transposed matrix of a nxn matrix, when making n into two or more integers. 
[0028] 

[Effect of the Invention] In order to ask for each transposed matrix from two or more nxn 
input matrices given one by one according to the 1st data processor concerning this 
invention as explained above The actuation which writes one vector of an input matrix in 
the same line or same train as the line or train which carried out reading appearance 
among memory cell arrays in the first half of a certain cycle in the second half of this 
cycle is repeated. And since a line bus and a train bus are properly used for every n cycle, 
a transposed matrix can be obtained at a rate of one piece in n cycle, and the throughput of 
data processing improves compared with the former. 

[0029] Moreover, in order to ask for each transposed matrix from two or more nxn input 
matrices given one by one according to the 2nd data processor concerning this invention 
writing one vector of an input matrix in the same line or same train as the line or train 
which carried out reading appearance in a certain cycle among memory cell arrays in the 
following cycle Since the actuation which reads the next line or next train of this memory 
cell array in the latter cycle is repeated and the handling of the row and column of a 
memory cell array is changed for n+every cycle A transposed matrix can be obtained at a 
rate of one piece in n+1 cycle, and the throughput of data processing improves. 
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